OTIC  FILE  COPY 


T10N  PAGE  Approve 

_  „  _  _  '  ,W  r#WBB  OMB  NO.  07044199 

D—  £x  2  T  §  x32  ••'•gx  ^  ,TlooM»-  w«  II**  '<*  r»w*l  '"ttnxtiom.  ywamg  **mn*  a*u  v»*rc»v 

*  »  1  »  »  Wfc  I  t*»  <0M*OK>n  o»  informttion  s«n4  commons  rntramf  tnn  ourMn  tnmm  o»J>»  otw  MMctoTtn* 

ortw.notw  ^Munn  *«*•«.  O.r«rof«.7or  in'on»«xM  Oortoon  «m  MiiST I2m»(5n«. 
Mjdtqiwl  jno  ludqtt.  ’Utfwjn  XMucboa  Protect  (07044  !*».  WMMAfton.  OC  JOJOJ. 

*  -  4  Btrvm  wMTt  |  ).  REPORT  TYPE  ANO  OATIS  COVERED  " 

FINAL  Report.  01  May  88  thru  30  sep  89 


4.  TITLI  ANO  SUBTITU  S.  FUNDING  NUMBERS 

MASSIVELY- PARALLEL  COMPUTATIONAL  FLUID  DYNAMICS  AFOSR-88-0212 

61102F  2304/A2 


1.  AGIIKT  U»a  UIUT  (l«M  »*n*> 


«.  AUTHORS) 

D.  A.  Calahan 


7.  PERFORMING  ORGANIZATION  NAME(S)  ANO  AOOAESS(ES) 

University  of  Michigan 
Department  of  Electrical  Engineering 
and  Computer  Science 
Ann  Arbor,  MI  48109-1092 


l  PERFORMING  ORGANIZATION 
REPORT  NUMBER 


AFOSR.TttJ  90  -  0  079 


».  SPONSORING/MONITORING  AGENCY  NAME(S)  ANO  ADDRESSES) 

AIR  FORCE  OFFICE  OF  SCIENTIFIC  RESEARCH 
3uilding  410 

3olling  AFB,  DC  20332-6448 


10.  SPONSORING  /  MONITORING 
AGENCY  REPORT  NUMBER 


AFOSR-88-0212 


12a.  DISTRIBUTION  /  AVAILABILITY  STATEMENT 

Approved  for  public  release  : 
distribution  unlimited. 


13.  ABSTRACT  (Maximum  200  worth) 


DTI 

electe 
FEB  081990 


12b.  DISTRIBUTION  CODE 


Algorithm  development.,-  Implementation  of  a  3-D  Navier -Stokes  implicit 
research  codejtxo  AFFDL  was  initiated.  This  required  the  parallel 
solution  of  block- tridiagonal  systems  and  so  was  more  challenging  than 
the  above  explicit  code.  This  is  being  continued  in  a  new  AFOSR  grant. 
jCopnection  Machine  experiments.  To  keep  abreast  of  SIMD  architecture 
performance,  a  series  of  Fortran  and  C  kerrels  were  studied  on  the 
Argonne  Laboratory  CM-2  during  the  summer  of  1989.  It  was  concluded 
that  the  available  Fortran  8X  implementation  on  the  CM-2  was  too 
inefficient  to  warrant  continued  study  at  that  time .  fa ^ ■*><>< \i r.  : 

x  La, C^p-10  f' '°j (V/wV'  rd  CPft-firtr  jf*:  J 


17.  SECURITY  CLASSIFICATION 
OF  REPORT 

UNCLASSIFIED 


NSN  75407)1  >280*5300 


18.  SECURITY  CLASSIFICATION 
OF  THIS  PAGE 

UNCLASSIFIED 


IS.  PRICE  COOS 


If.  SECURITY  CLASSIFICATION  I  20.  LIMITATION  OF  ABSTRACT 
OF  ABSTRACT  1 


UNCLASSIFIED 


Standard  Form  298  (Rtv.  2-89) 

•~wno*  «*  »NV  in.  £3*.  i* 


«.  » 


Final  Technical  Report 
for  Period  5-1-88  to  9-30-89 


Grant  AF-AFOSR-88-0212 


MASSIVELY -PARALLEL  COMPUTATIONAL  FLUID  DYNAMICS 


D.  A.  Calahan 
Principal  Investigator 
Laboratory  for  Scientific  Computation 
Department  of  Electrical  Engineering  &  Computer  Science 

University  of  Michigan 


Ann  Arbor,  MI 


October  30,  1989 


Accession  For 

HTIS  GRAJfcl  df" 

DTIC  TAB  D 

Unannounced  □ 

Justlfioat  Ion - - 

By - 

Distribution/ _ 

Availability  Codes 
[Avail  and/or 
Dlst  Special 


90  02  0  6  I5j? 


'iNSPtcno 


I.  Progress  Report 


Goal 

The  research  is  intended  to  develop  methodologies  for  specifying 
major  machine  design  parameters  for  distributed  parallel 
architectures  useful  in  CFD.  The  effort  has  two  major  components. 

(1)  Gain  algorithm  experience  in  conversion  of  a  suite  of 
Air  Force  production  and  research  CFD  codes  to  a  general  format 
applicable  to  a  variety  of  such  commercial  architectures. 

(2)  Relate  critical  code  execution  timing  features  to  the 
machine  parameters. 


Progress  (5/88-9/88) 

(1)  Algorithm  development  [2]  .  A  full  3-D  Navier-Stokes 
explicit  production  code  from  AFFDL  was  implemented  on  a  1024-node 
scalar  NCUBE  hypercube  at  SANDIA  (Albuquerque) . 

(2)  Distributed-memory  architectures  for  CFD  [1] [3] .  Timing 
models  of  this  code  were  developed  in  terms  of  four  critical 
design  specifications  -  MFLOP  execution  rate,  message  startup  and 
transfer  times,  and  local  memory  size.  These  models  were  intended 
to  permit  a  graphical  specification  of  hardware  tradeoffs  to 
achieve  given  levels  of  parallelization  efficiency. 

Progress (10/88-9/89) 

(1)  Algorithm  development.  Implementation  of  a  3-D  Navier- 
Stokes  implicit  research  code  from  AFFDL  was  initiated.  This 
required  the  parallel  solution  of  block-tridiagonal  systems  and  so 
was  more  challenging  than  the  above  explicit  code.  This  is  being 
continued  in  a  new  AFOSR  grant. 

(2)  Connection  Machine  experiments.  To  keep  abreast  of  SIMD 
architecture  performance,  a  series  of  Fortran  and  C  kernels  were 
studied  on  the  Argonne  Laboratory  CM-2  during  the  summer  of  1989. 
It  was  concluded  that  the  available  Fortran  8X  implementation  on 
the  CM-2  was  too  inefficient  to  warrant  continued  study  at  that 
time . 
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II.  Coupling  Activities 


Air  Force  Flight  Dynamics  Laboratory 

Several  of  the  codes  under  study  to  illustrate  the  architectural 
modeling  process  were  obtained  from  Dr.  Joseph  Shang,  director  of 
the  Computational  Aerodynamics  Group  at  AFFDL.  Five  visits  were 
made  to  his  laboratory  to  plan  the  coordination  of  our  research 
efforts.  A  four-processor  NCUBE  massively-parallel  algorithm 
preparation  workstation  was  loaned  to  his  group  to  assist  in  these 
joint  efforts. 

Near  the  end  of  the  grant  period,  it  was  decided  during  a  visit 
that  a  moderately-parallel  message-passing  network  of  workstations 
would  be  of  immediate  interest  to  Dr.  Shang' s  group;  further,  it 
appeared  that  the  above-mentioned  parallelized  CFD  code  would  be 
directly  applicable  to  such  an  architecture.  Work  is  continuing 
on  this  research  under  new  AFOSR-sponsored  funding. 
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